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Flavor- violating interactions involving new heavy particles are among proposed ex- 
planations for the ti forward-backward asymmetry observed at the Tevatron. Many 
of these models generate a ti-plus-jet signal at the LHC. In this paper we iden- 
tify several new charge asymmetric variables in tij events that can contribute to 
the discovery of such models at the LHC. We propose a data-driven method for the 
background, largely eliminating the need for a Monte Carlo prediction of it-plus-jets, 
and thus reducing systematic errors. With a fast detector simulation, we estimate 
the statistical sensitivity of our variables for one of these models, finding that charge- 
asymmetric variables could materially assist in the exclusion of the Standard Model 
across much of the mass and coupling range, given 5 inverse fb of data. Should any 
signal appear, our variables will be useful in distinguishing classes of models from 
one another. 
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I. INTRODUCTION 

The most peculiar among the Standard Model fermions, the top quark has challenged 
the high energy physics community, both on the experimental and theoretical level, since 
its discovery in 1995. From the theoretical viewpoint, its exceptional mass suggests that it 
might play a special role in the mechanism of electroweak symmetry breaking. This occurs 
in a number of proposed theories, including Little Higgs and Top-color Assisted Technicolor, 
and even within many supersymmetric models. On the experimental side, the predictions 
of the Standard Model (SM) for the top quark are still not fully tested. At the Tevatron, 
the high production threshold limited the number of ti events, and only now at the LHC 
will it be possible to perform precision measurements of the top quark's properties. 

While most aspects of the top quark agree so far with SM predictions, both the CDF 
[U [3] and DO [21 H] collaborations have reported an anomalous forward-backward asymmetry 
for ti pairs at intermediate to high invariant mass, much larger than expected from SM 
calculations [5T4TU]. This result, which relies upon "forward" being defined relative to the 
Tevatron's proton beam, cannot be immediately checked at a proton-proton collider such 
as the LHC. However, it is well-known that forward-backward asymmetries at a proton- 
antiproton machine lead to differential charge asymmetries at a proton-proton machine, and 
indeed, a differential charge asymmetry in ti production, as a function of the t quark's 
rapidity, should be observable. This quantity has been discussed by theorists, for instance 
in p~5HT8l I5H 175]. and has been measured at the LHC experiments [T2HL1] . The statistical 
errors on this measurement are still rather large, however, and meanwhile the LHC's higher 
energy allows its experiments to probe for related phenomena in other ways. 

No significant problems with the SM calculation or the experimental measurements of the 
anomalously large asymmetry have been found. Meanwhile, a variety of models have been 
proposed to explain it. Most of these produce the asymmetry through the exchange of a 
new particle, either an s-channel mediator with axial couplings to both top and light quarks 
[I9H39] . or a t-channel (or u-channel) mediator [101 - 159] with flavor- violating couplings that 
convert a light quark or antiquark to a top quark. Both processes are illustrated in Fig. [TJ 
In [60H7T] . comparisons between different models are carried out, and study of those models 
or measurements in the LHC context can be found in [72HS3]- 

Charge asymmetries at the LHC are known to be powerful tools for searching for and 
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studying new physics, and recently this has been put to use in the context of models for the 
ti asymmetry. In [52] a large overall charge asymmetry was used to argue the Shelton-Zurek 
model [13] was most likely excluded; a similar method was then applied for a different model 
in [HI]. Here, we focus on models with t- or w-channel mediators, which, as we will see, often 
generate large charge asymmetries in tij (top plus antitop plus a jet) at the LHC. These 
asymmetries, a smoking gun of this type of model, will be crucial for a convincing discovery 
or exclusion of this class of models. Note these asymmetries are not directly related to the 
Tevatron forward-backward asymmetry in ti events, which translate at the LHC into the 
differential charge asymmetry in t production mentioned above. The asymmetry in tij that 
we study here stems from a completely different source; see below. 

Any of the models with a t- or u-channel mediator has a coupling between a light quark 



or antiquark, a top quark, and a new particle X, as in Fig. 1(b). It follows that the X 
can be produced from an off-shell quark or antiquark in association with a t or i, as shown 
in Fig. [2j Consequently, as has been pointed out by many authors [121 HH EH EEJ E21 ED] , 
it is important at the LHC to look for the process pp — > Xt (and the conjugate process 
pp — > Xi), where X in turn decays to i plus a jet. A straightforward search for a t+jet 
resonance can be carried out, though it suffers from the poor resolution for reconstructing the 
resonance, large intrinsic backgrounds whose shape may peak near the resonance mass, and 
combinatoric backgrounds in the reconstruction. Alternatively, one could attempt a cut-and- 
count experiment; with appropriate cuts one can obtain samples in which the X production 
contributes a statistically significant excess to the tij rate. But the tij background is not 
simple to model or measure, and systematic errors may be problematic. 

Fortunately, the process shown in Fig. [2] has a large charge asymmetry. The difference 
between quark and antiquark pdfs assures that the rate for X production is different from 
that of X production. (If X is self-conjugate, same-sign top-quark production results, and 
is readily excluded [88, 8!)*: we therefore assume that X 7^ X.) Our approach in this work 
will be to suggest something a bit more sophisticated than a simple resonance search, using 
the charge asymmetries of these models to reduce systematic errors at a limited price in 
statistics. We will also propose other charge-asymmetric variables that can serve as a cross- 
check. As a by-product, should any discovery occur, the asymmetry itself can serve as a 
diagnostic to distinguish certain classes of models from one other. 
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FIG. 1. Diagrams that can lead to a forward-backward 
asymmetry at the Tevatron in tt production. The X is 
exchanged either (not both) in the s— or t—/u— channel, q 
may be u or d. 

II. BENCHMARK MODELS 
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FIG. 2. For a t- or u-channel 
mediator X, direct produc- 
tion of tX (followed by X — > 
t + q or q) is always possible. 



As our benchmark model, we take a typical model with a i-channel mediator, a colorless 
charged spin-one particle which we call a W. We will assume the W couples a right-handed 
d quark to a t quark. While a theory with only these couplings would be inconsistent, we 
will assume this coupling generates the largest observable effects. One may say that we 
choose a "simplified model", or "model fragment", in which this coupling is the only one 
that plays an experimentally relevant role. We will see this point is not generally essential]]] 
The Lagrangian we take for our simplified model is simply 

C = -g R W'+t^P R d + h.c. (1) 

where P R = (1 + 7 5 )/2. 

We are interested in the process in which the W contributes to a ttj final state. One 
contribution comes from dg — > tW'~ and its conjugate dg — > iW' + , following which the W'~ 
decays to id and the W' + decays to td. We will refer to this as "s-channel production" 
(see Fig. [3]). The W also contributes to dg — > tid, and similar processes, through t-channel 
exchange (see Fig. 

The cornerstone of our analysis is the observation that in the s-channel process, the 
negatively charged W is produced more abundantly than the positively charged W, because 



1 Attempts to make consistent models with a W include [57]. There are also attempts to include the 
coupling of a W with a u and b quark |48j , but such couplings lead to a large charge asymmetry in single 
top production [S5] , now excluded by LHC data [55] [5S] . 
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FIG. 3. Dominant production mode for the W' . The cross-section for W'~ is much larger than for 
W'+. 
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FIG. 4. Characteristic examples of diagrams that contribute to tij production involving the W in 
t-channel exchange. 

the negative W can be produced from a valence quark, while a positive W requires a sea 
antiquark in the initial state. (See Fig. [3j) 

The processes in Figs. [3] and [4] can in principle have non-trivial interference with the Stan- 
dard Model background — a point which considerably complicates background simulation. 
But we have found that interference is not numerically important for certain observables, at 
least with current and near-term integrated luminosities. All results in this paper therefore 
ignore interference; however, with larger data sets, or when studying other models and/or 
using other variables, one must confirm on a case-by-case basis that this approximation is 
sufficiently accurate for the analysis at hand. 

In [17] , the authors studied this model and fitted it to the ti asymmetry and total cross- 
section in CDF. (This was done prior to the DZero result that shows a smaller asymmetry 
with less energy dependence.) Based on this work, we will take six benchmark points shown 
in Table |TJ with three values of the W mass and two values of gn for each mass, a larger 
value that would reproduce the CDF measurement and a value \/2 smaller that would give 
a Tevatron asymmetry (and also an W width and tW production rate) of about half the 
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Mass (GeV) 


9R 


cross-section (pb) 


400 


1.5 


32.2 


400 
600 


1.5 

2 


12.9 
18.2 


600 


V2 


6.3 


800 


2 


6.5 


800 


V2 


2.1 



TABLE I. 7 TeV LHC tree-level cross-sections for the processes shown in Figs. [3] and |4j for the 
various benchmark points. No K-factor is included in these numbers, but we do apply one later in 



our analysis; for a discussion of the simulations and the K-factor, we refer the reader to Sec. IV 



size. The cross-sections at these benchmark points (including all the processes shown in 
Figs. [3] and [4]) are also given in Table [jj 

The W also contributes to ti production through t-channel exchange, and thus to the 
differential charge asymmetry in t rapidity at the LHC (not to be confused with the asym- 
metries in tij that are the subject of this paper.) ATLAS and CMS measurements of this 
quantity (with respectively 0.7 and 1.1 fb _1 of data) [121 EH] may somewhat disfavor the 
benchmark points with the larger values of gn, which (at parton-level, not accounting for 
t reconstruction efficiencies) give a differential charge asymmetry in the 8-9% range. But 
the situation is ambiguous, since event mis-reconstruction and detector resolution produce 
a large dilution factor, which may make this charge asymmetry consistent with the current 
measurements. Our benchmarks with larger couplings thus probably represent the outer 
edge of what might still be allowed by the data. By considering also an intermediate cou- 
pling that still could explain the Tevatron ti asymmetry, we cover most of the interesting 
territory, and permit the reader to interpolate to other values of the couplings. 



III. A MASS VARIABLE 



Among the charge-asymmetric observables discussed in this paper, we will devote most 
of our attention to one motivated by the resonance structure of the W, which we will refer 
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to as the mass variable Mjuw in later content. This variable is applicable universally to 
a wide range of W masses and couplings, and to most other models with tX production. 



We discuss this mass variable in great detail in this section. In Sec. VI, we will discuss the 
azimuthal angle between the hardest jet and the lepton (which we refer to as the "angle 
variable".) A third class of potentially useful variables ("Pt variables"), including the Pt 
difference between the hadronic and the leptonic top quarks or W-bosons, is briefly discussed 
in Appendix [Cj 

We will consider only the semi-leptonic tij events (where one top decays hadronically and 
the other leptonically) , resulting in a final state of 5 jets, a lepton and missing energy. All- 
hadronic decays are not useful for a charge asymmetry, as t and i cannot be distinguished 
in this case, while the fully leptonic decay, though probably useful, has a low branching 
fraction. 

Since it is the s-channel process in Fig. [3j where the W appears as a resonance, that is 
charge-asymmetric, we will focus our attention there. In our later analysis we will impose 
an St cutj^] to improve the signal-to-background ratio. If we put that cut at 700 GeV, the 
fraction of negatively charged W's for the 400, 600 and 800 GeV W is 0.84, 0.87 and 0.86 
respectively. Such an enormous charge asymmetry in production can be put to good use. 

Note, however, that since every event (following the W decay) has a t and a t, either 
of which may produce the lepton, the total numbers of events with positively and negatively 
charged leptons are expected to be roughly equal, up to edge effects produced by cuts and 
detector acceptance. But since negative W's are produced more abundantly, a negatively 
charged lepton is more likely to come from the W decay, while positive leptons tend to 
originate from the decay of the spectator top quark or antiquark. Kinematic features, such 
as the invariant mass and transverse mass of various final-state objects, differ for events 
with negatively and positively charged leptons. For instance, a simple bump hunt aimed 
at reconstructing the W resonance would find a much larger bump in negatively charged 
leptons than in positively charged ones. Here, we will consider the W reconstructed mass 
distribution more completely, noting that the signal remains asymmetric even away from 
the W mass bump, since the total asymmetry must integrate to (almost) zero. 

Another useful kinematical feature is that the hardest jet in tW — > tid production 
commonly originates from the <i-quark, because of the large energy released in the W decay 
For our definition of St, see equation Q in Sec. 



IV 
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and the dissipation of the top quarks' energies into their three daughters. At leading order 
and at parton-level, and with an St cut of 700 GeV, the fraction of events where the hardest 
parton is the <i-quark (or antiquark) from the W is 0.71, 0.82 and 0.82 for a W of mass 400, 
600 and 800 GeV respectively. (Note neither ISR/FSR, hadronization, nor jet reconstruction 
are accounted for in these numbers, which are for illustration only.) We have designed our 
variables to maximally exploit these two kinematic features. 

One conceptually simple approach to seeking the W would involve fully reconstructing 
the t and t in each event, and searching for a resonance in either tj or tj. This has been 
discussed in (121 HH H7J EH1 E2J ED] ■ The challenge is that the combinatoric background is 
large and hard to model, and often peaks in a region not far from the resonance. Charge- 
asymmetries are useful here, because the positive-charge lepton events are dominated by the 
combinatoric background, while the negative-charge lepton events have similar combinatorics 
but a much larger resonance. Comparison of the two samples would allow for the elimination 
of a significant amount of systematic error. 

However, full event reconstruction in events with five jets will have low efficiency, and 
moreover we are neither confident in our ability to model it nor certain it is the most effective 
method. Here we will instead focus on variables that require only partial event reconstruc- 
tion. Of course the experimental groups should explore whether full event reconstruction is 
preferable to the methods we attempt here. 

We will focus on the mass variable M^y: the invariant mass of the hardest jet in the 
event, a 6-tagged jet (chosen as described below), and a TV-candidate reconstructed from the 



reconstruction of the event to form a candidate for the W, assuming it has decayed to a 
leptonj^] In signal events where the hardest jet in the event is a d (or d) from the W decay, 
and the i (or t) from the W produces a lepton £, Mj UW often reconstructs the W resonance. 
The events with an i~ typically exhibit a resonance at the W mass, while those with an 
£ + , in which the W is most often not reconstructed correctly, have a smoother distribution. 
This effect, and the resulting charge asymmetry — with a negative asymmetry near the W 

3 We solve for the neutrino four-momentum in the usual way. Complex solutions are discarded for simplicity. 

When two real solutions exist, the most central W candidate is selected. 

4 Were one to fully reconstruct the ttj events, one could also study the invariant mass of the hadronically- 

decaying top and the hardest jet, which will also differ for positive- and negative-charge lepton events. 
We neglect this variable here because the reconstruction of the hadronic top has low efficiency, but we 
encourage our experimental colleagues to consider if they can increase their sensitivity by including it. 



observed lepton and the missing transverse momentum 
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(a)Mjibw (parton-level) for signal only, shown for 
positive and negative lepton charge. 
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(b) Bin- by-bin signal-only charge asymmetry 



Asymmetry 

l.Or 



0.5 



0.0 



-0.5 



-1.0 



500 . 6g0 * 



700 



800 



M 



jlbW 



(d)Bin-by-bin signal plus background charge 
asymmetry 



(cjMjibw (parton-level) for signal plus 
background, shown for positive and negative 
lepton charge. 

FIG. 5. Parton-level charge asymmetry in the Mjnw variable for a 600 GeV W with gn = 2 and 
an St cut at 700 GeV. The leptonic W boson was reconstructed from its decay products, j\ was 
taken to be the hardest non-6 parton. ISR/FSR and 6-quark selection effects were not accounted 
for here. The sample corresponds to 1.5 fb _1 . 



mass and positive asymmetry elsewhere — are shown for mw 1 = 600 GeV in Fig. [5j Both 
the asymmetric s-channel and the almost symmetric t-channel are included in what we call 
"signal." 

In constructing Mj UW , we reduce the combinatorial background by rejecting 6-jets that 
are inconsistent with forming a top quark with the lepton and the MET (M^ < 155 GeV 
and M^ lv < 175 GeV.) When multiple 6-jets satisfy these criteria, we select the 6-quark for 
which the quantity \Mu — 155 GeV| + \M£ V — 175 GeV| is smallest. The combined efficiency 
of the W reconstruction and the 6 selection is about 45%. 



Meanwhile, we will give evidence in Sec. |V A| that the SM background to this process 
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shows no charge asymmetry in this variable, to a sufficiently good approximation. It is 
crucial for the use of this variable that this is true. 

There are other invariant-mass and transverse-mass variables that have their merits. 
Some require no event reconstruction, including the invariant mass of the hardest jet and 
the lepton (Mju) and the invariant mass of the hardest jet, a 6-tagged jet and the lepton 
(Mjibi). For quantities that include the MET in the event, one could consider the transverse 
mass of two or more objects. (See also the footnote above concerning the hadronically 
decaying top in fully reconstructed events.) These variables and their charge asymmetries 
are strongly correlated, but one might still obtain additional sensitivity by combining them. 
But here, for simplicity, having found that the most sensitive variable on its own is Mju,w, 
we will focus on it exclusively below. 

IV. EVENT SELECTION AND PROCESSING 

We mentioned earlier that the tij background and the W signal do interfere with each 
other. However we have explicitly checked that interference effects do not alter the differen- 
tial asymmetry in the Mj\bw mass variable by a significant amount (given currently expected 
statistical uncertainties). The effect on the total number of events is also small. Thus it is 
relatively safe for us — and for the early searches at the LHC — to neglect interference in 
the study of the mass variable, at least for the W model. (We have not studied whether this 
is true for all similar models with tX production.) At some point, higher-precision study 
with much larger data samples (^> 10 fb _1 ) may require the full set of interfering diagrams, 
and a special-purpose background-plus-signal simulation. Here we simulate background and 
signal independently. 

On the other hand, t-channel W exchange (Fig. [4]) makes an important contribution to 
the cross-section and should always be included when generating the signal sample. (This 
is not uniformly the case in the literature.) For the variables we are studying, the t-channel 
process does not contribute much to the asymmetry, and effectively acts as an additional 
background. 

A background sample and the signal samples for our benchmark points were generated 
with Madgraph 4.4.32 [90] and showered with PYTHIA 6.4.22 [91J. We performed a fast 
detector simulation with DELPHES 1.9 [93J. (For our parton-level studies the decays of 
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the top and the antitop were simulated with BRIDGE 2.24 [92]). We used the anti-fcr 

jet-clustering algorithm (with R = 0.5) to reconstruct jets. The isolation of leptons and 



the ATLAS collaboration |94j. We account for the rising P^-dependence of the 6-tagging 
efficiency, which reaches up to 60% in the kinematic regime of interest. The dependence 
of the b tagging efficiency on the pseudo-rapidity is assumed to be negligible within the rj 
reach of the tracker (\rj\ < 2.4), with the tagging rate taken to be zero outside the tracker. 
The c-tag efficiency was assumed a factor of 5 smaller and the mistag rate is taken to be 
1%. We do not account for the falloff in efficiency and the rise in mistag rates at higher Pt, 
since measurements of these effects are not publicly available; our tagging might therefore 
be optimistic, though the issue affects both signal and background efficiency. 
We impose the following criteria for our event selection^] 

• At least 5 jets with P T et > 30 GeV and \rj\ < 5 

• At least one of these jets is 6-tagged 

• One isolated lepton (e^ or /r* 3 ) with P T > 30 GeV and \r]\ < 2.5 



where rj stands for pseudo-rapidity as usual. We also impose a cut on St, which is defined 

as 



where the sum runs over all the jets with P T e > 30 GeV. The St cut will be at a high enough 
scale (typically 600-800 GeV) that our events will pass the trigger with high efficiency. 
The SM background simulation requires a matched sample for 

p + p — > t + t 
p + p^-t + i + j 

5 Our cuts may be optimistic in the rapidly changing LHC environment. Raising the jet Pt cut to 40 GeV 
results in a loss of sensitivity of order 10-20%. If one restricts jets to those with |ry| < 2.5, signal is reduced 
by about 10%, and background by about 15%. An increase in the electron Pt cut to 45 GeV reduces 
signal by 20-25% and background by about 30%. 



jets is described in Appendix |B 1 The 6-tagging was modeled after the SV050 tagger of 



• MET > 30 GeV. 




(2) 
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where we use the MLM scheme [95], with QCUT= 30 and xqcut= 20. The renormalization 
and factorization scales are set to %, where m T is the the geometric mean of w? t + p\ for 
the top and antitop. 

One might wonder whether it is necessary to include pp — > tijj as well. But we are 
requiring 5 hard jets, and the mass and angle variables we will study are not sensitive to soft 
radiated jets, as they involve the hardest jet and a 6-tagged jet. It is sufficient, therefore, 
for us to truncate our matched sample with one jet, and allow PYTHIA to generate any 
additional radiation. In total, we generated 3 million background events before matching. 
After matching, we find an inclusive ti LO cross-section of about 90 pb, so we include a 
K-factor of 1.7 to match with the NLO+NNLL QCD calculation [961 EZJ- The number of 
events we generated for background corresponds to about 14 fb _1 , large enough to provide 
smooth distributions for the variables we study. 

There are a number of SM processes whose total cross-sections for producing a lepton 
are intrinsically charge- asymmetric. These include single-top production and IV-plus-jets, 
for which an £ + is more likely than an t~ . However, these have small rates for 5 jets and 
a lepton, especially with a b tag required and with a hard St cut. Moreover, asymmetries 
from any such process would be quite different from the signal, being both structureless and 
everywhere positive. We foresee no problem with such backgrounds. 

For each value of the W mass and coupling constant, we generated a signal sample with 
750,000 events. No matching was used; extra ISR/FSR jets were generated by PYTHIA. 
These samples are large enough to suppress statistical fluctuations when we later use them 
to study the expected shape and magnitude of the asymmetry. In our studies, we have 
chosen to scale all LO signal cross-sections, for all six benchmark points, by a K-factor of 
1.7, the same as for the ti background^ Note that this K-factor can always be absorbed in 
gji, as long as the width of the W is smaller than the resolution. 



V. ANALYSIS AND RESULTS 



Although the parton-level charge asymmetries described in Sec. Ill are large, the exper- 



imentally observable asymmetries are significantly diluted by the detector resolution and 



6 We note that the K-factor for the process bg — > tW is in this range [IT], suggesting our choice is not 
unreasonable. 
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mis-reconstructions. Fig. [6] shows our estimate of the asymmetry structure that can be 
obtained at the detector level; compare this with Fig. [5] Note, however, that the basic 
structure of a negative asymmetry at the W peak, with a positive asymmetry to either side, 
remains intact. 

As always, one needs to obtain a prediction for both the Standard Model-only assumption 
(SM) and the Standard Model plus new physics assumption (NP), and assign a degree of 
belief to one or the other using a suitable statistical procedure, given the observed data. 
We will argue below that the SM prediction for the asymmetry in Mj^w is essentially zero, 
within the statistical uncertainties of the measurement. However, to predict the asymmetry 
in the presence of a signal requires a prediction of its dilution by the background. The 
background is also needed in order to predict the size of the fluctuations of the SM asymmetry 
around zero. 

Direct use of Monte Carlo simulation to model the SM background distribution would be 
a source of large systematic errors, as NLO corrections are not known, and since we impose 
a hard cut on St- We therefore propose a (partially) data-driven method, minimizing this 
systematic error while keeping the statistical errors under control. The result can then be 
combined with a signal Monte Carlo to predict the differential asymmetry in Mjiw The 
search for a signal will then involve fitting this expectation to the data. 

Our first task is to discuss how to obtain the prediction (which we will refer to as a "tem- 
plate") for the differential asymmetry in Mji bW , under both the SM and NP assumptions. 
We will begin by arguing that the SM asymmetry template is zero to a sufficiently good ap- 
proximation. Next we will make a proposal for a partially data-driven method to determine 
the template for a given NP assumption, with low systematic uncertainty. Finally, we will 
estimate the sensitivity of our variables, using a simplified statistical analysis based in part 
on our proposed method. Along the way we will find the preferred value of the St cut. 

A. The SM Template: Essentially Zero 

It is crucial for our measurement that the asymmetry in the SM background be known, so 
that the presence of a signal can be detected. It would be even better if the SM asymmetry 
is very small. Here we give evidence that this is indeed the case. 

It is essential to recognize that the SM background to the tij process is very different 
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(a)M jlbW for a 400 GeV W, g R = 1.5 
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(c)M jlbW for a 600 GeV W, g R = 2 
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(b)Bin-by-bin asymmetry for a 400 GeV W , 
9R = 1-5 
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(d)Bin-by-bin asymmetry for a 600 GeV W', 
9r = 2 
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(e)M jlbW for an 800 GeV W, g R = 2 



(f)Bin-by-bin asymmetry for an 800 GeV W, 
9R = 2 

FIG. 6. As in Fig. [5j but after accounting for detector effects, and with an St cut of 700 GeV, for 
three different W masses. All plots show signal plus background. The samples correspond to an 
integrated luminosity of 5 ftV 1 . 



from the SM background to the ti process. In ti, all asymmetries are zero at LO. The non- 
vanishing SM asymmetry in ti therefore arises from an NLO effect, involving both virtual 
corrections to ti and real emission, that is, tij. The asymmetry therefore cannot be studied 
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FIG. 7. A parton- level study of the SM background asymmetry for the mass variable with a 700 
GeV St cut, corrsponding to 12 fb _1 luminosity. Other simulations confirm that the asymmetry 
appearing at 1200 - 1400 GeV is a statistical fluctuation. 

at all with a leading-order event generator, and in a matched sample (which contains tij 
but not the virtual correction to tt) it would actually have the wrong sign. 

However, for tij itself, differential charge asymmetries at LO are not zero. The correction 
to these asymmetries from NLO corrections to tij are subleading in general. Therefore we 
can ask the following question of an LO generator: although the generic observable in tij 
events will show a charge asymmetry, is this the case for the Mjibw variable, or is any 
asymmetry washed out? 

We find that the asymmetry in the mass variable is consistent with zero, as one can see 
in Fig. [7j This also turns out to be true for the angle variable which we will discuss later. 
We emphasize that this was not guaranteed to be the case. One can find variables that, at 
LO and at parton-level, exhibit asymmetries. An example is the asymmetry between the Pt 
of the t and that of the i, which is of order 4% at parton-level. The fact that qg — > tij has 
rather small asymmetries, and that the symmetric gg initial state contributes significantly 
to tij, helps to reduce the size of any observable asymmetries. After reconstruction and 
detector effects, nothing measurable remains. 

We know of no reason why NLO corrections would change this conclusion. Neither virtual 
corrections nor real jet emission have any reason to strongly affect M^y/. For this reason 
we will treat the SM background as purely symmetric. 

No argument of this type is airtight. Fortunately, the experiments do not need to rely 
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entirely upon it. As we see in Fig. |6j the asymmetry in the signal has a characteristic 
kinematic structure. Moreover, related asymmetries will show up in several mass variables 
in a correlated way, due to the W, and one would not expect similar correlations in the 



background. Finally, a signal is likely also to appear in the angle variable discussed in Sec. VI 



The existence of these multiple cross-checks should allay any concerns that a measurement 
of a non-zero asymmetry might be uninterpretable. 



B. Obtaining NP Templates and Accounting for Fluctuations 

We now discuss how to obtain the NP template that is needed for each benchmark point. 
In addition one needs to be able to estimate the fluctuations that can occur under both the 
SM and NP assumptions. We emphasize the possibility of data-driven approaches. 

We will find it useful to introduce some notation (summarized in Table [n]) in which Sf 
and Bf represent, for a signal-only and background-only Monte Carlo sample, the number 
of events in bin i with a positively- or negatively-charged lepton £ ± . Df denotes the similar 
quantity in data (and is thus not generally equal to the expected result S i + Bf .) At 
some point we will need a smoothed version of the data, which we denote via [Df]. The 



Bf (Bf ) 


Number of positive (negative) lepton events in i th bin, for background-only 
Monte Carlo. 




As above, for signal-only Monte Carlo. 


Df (Bf) 


As above, in observed data. 


m m )] 


As above, in a fit to the observed data. 


At 


Predicted charge asymmetry the i th bin for a particular hypothesis. 


Ai 


Charge asymmetry in i th bin as observed in data. 


C-n 


Amplitude for best fit of an NP template to the n th pseudo-experiment under 
the SM hypothesis. 


C 


Amplitude for best fit of an NP template to the asymmetry observed in the 
data. 




Standard deviation of the c n . 



TABLE II. Notation used throughout Sec. [Vj 
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differential charge asymmetry predicted by the template for a particular benchmark point, 
or by the SM itself, we denote by Aj. Meanwhile, we call the observed asymmetry in the 
data Ai. 

Let us first focus on the statistical fluctuations around the template for the SM, which as 
we argued above in Sec. V A can be taken to be zero. Whenever one needs this template, it 
is under the assumption that the data is pure SM. Even without signal, there will be plenty 
of data with > 5 fb _1 and an St cut of order 700 GeV. It therefore appears that rather than 
obtain the fluctuations around zero using a Monte Carlo sample B iy one would have much 
smaller systematic errors using the data Di = Df + D~ itself. One could probably do even 
better using a fit [Di] to the data, smoothing the bin-by-bin fluctuations in the numbers of 
events. We believe that the remaining statistical uncertainties that come with this method 
of modeling background will be smaller than the systematic uncertainties on an LO Monte 
Carlo for Bi. From this data-driven model, one may determine the expected size of the 
fluctuations on A\ by performing a series of pseudo-experiments. 

Next let us consider how to determine the template A 1 for a particular NP hypothesis 
We could of course simply compute it from large Monte Carlo samples, with Monte Carlo 
integrated luminosity Cmc much larger than the integrated luminosity in data Cdata, for Si 
and Bi. 

q + _ q- 

A. = i * (Q\ 

l ~ St + S7+Bt + B7 [6) 



(Recall we are ignoring interference for nowjj) Here the Bf cancel in the numerator, since 
the asymmetry in the SM background is assumed to be zero. With this approach statistical 
errors can be made arbitrarily small, but systematic errors on the SM background prediction 
could be very substantial. The process tij has never previously been measured at these 
energies, and after the St cut it is difficult to estimate how large the systematic errors 
might be. Moreover we know of no way to extract the tij background reliably, in the 
presence of signal, without the potential for signal contamination. 

An alternative purely data-driven approach would be to use the suitably-fitted charge- 
symmetric data [Df + Df] in the denominator of (pi). For the numerator one may take a 



7 If interference cannot be neglected, as might happen with very large data sets or perhaps with other 
models that we have not explored in detail, then our separation of Si and Bi is naive. What must then 
appear in the numerator is the difference of positive and negative lepton events in the combined signal 
and background. Systematic errors will then presumably be somewhat larger. 
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large Monte Carlo sample for Si, and scale it to the luminosity of the data sample, giving 



(S+-S- 



dat a 



a > - %r^r (4) 

where again Cdata and Cmc are the luminosities of the data and the signal Monte Carlo 
sample. This method introduces correlations between the prediction of the template A 1 
and the measurement A 1 which would have to be studied and accounted for. However, the 
systematic error introduced by these correlations may in many cases be much smaller than 
those introduced by relying on a Monte Carlo simulation for the denominator, as in ([3]). In 
addition, statistical errors that arise from the finite amount of data, which would be absent 
with a large Monte Carlo sample, are negligible, as can be seen as follows. The statistical 
error on the predicted asymmetry A\ is dominated by fluctuations of the denominator of (Jij), 
since the statistical error on the numerator of Q can be made arbitrarily small by increasing 
£>mc : 

a(A t ) 1 



(5) 



However, for the measured asymmetry A4, defined as A; L = \ L , the error is always (for 
these models) dominated by the numerator: 



o-(Ai) _ 1 




(6) 



Ai (Dt + D-y/i 

More precisely, since the largest observed asymmetries per bin will be of the order of 0.15, 
the statistical error on the observed asymmetry is always larger than the statistical error on 
the template — o~(Aj) » cr(Aj). And again we emphasize that this data-driven method 
reduces systematic uncertainties from what is often the largest source: the lack of confidence 
that the tij background is correctly modeled. This comes at the relatively low cost of mild 
correlations between prediction and data, and some additional minor statistical uncertainty. 

Partially data-driven approaches are also possible. Even if one uses [Di\, the choice of 
fitting function could be determined in part with the use of Monte Carlos for Bi and S^. 
Interestingly, the distribution in the variable Mj\b\v is quite similar in signal and background, 
so the presence of signal, though it affects the overall rate, does not strongly affect the overall 
shape away from the W resonance. 

Since the pros and cons of these methods are luminosity-dependent, and dependent upon 
the details of the analysis, the only way to choose among these options is to do a study at the 
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time that the measurement is to be made. We therefore do not attempt any optimization 
here. Whatever method is used, the last step in the process in obtaining the NP template 
is to fit the Ai to a smooth function, which then serves as the template for the asymmetry 
in this particular benchmark point. (The size of the fluctuations around this template can 
again be obtained from [Di], as we suggested for the SM template.) After repeating this 
process for a grid of benchmark points, one may then compare the data to the SM null 
template or to any one of the NP templates. In the next subsection we will carry out a 
simplified version of this study, to investigate the effectiveness of our methods. 



C. Effectiveness of Our Method: A Rough Test 

A full evaluation of our method, carrying out precisely the same analysis that the experi- 
mentalists will need to pursue, would require more firepower than we have available. Instead 
we will carry out a somewhat simplified analysis, asking the following question: 

If the NP hypothesis for a certain benchmark point is realized in the data, what is the 
average confidence level at which we can reject the SM hypothesis? 

The answer to this question will serve two purposes. First, it will give a measure of how 
sensitive a complete analysis will be for distinguishing the SM from various NP scenarios. 
(More precisely, it will be slightly optimistic, as we will discuss, but not overly so.) Second, 
it will allow us to estimate what value of the St cut is optimal for different benchmark 
points. 

We have not yet said much about the St cut, so let us remark on it now. Without such a 
cut, the signal to background ratio in the tij sample is small, as small as 1:45 for mw 1 = 800 
GeV with gn = \f2. However, the situation can be much improved using the fact that the 
signal St distribution tends (especially for heavy W's) to sit at much larger values for signal 
than for the SM background. (See Fig. |8j note these plots show the St distributions for our 
large-gn benchmark points. From this one can see that a simple counting experiment would 
not be trivial.) The optimal value of the St cut depends on the model, the analysis method 
and the luminosity. For most of our purposes an St cut of the order of 700 GeV is suitable, 
as we will see later. 
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FIG. 8. St distributions of signal and background, for various benchmark points. For computa- 
tional reasons we did not simulate events with St < 450 GeV. The samples correspond to 8 fb _1 . 



Answering the italicized question posed above is equivalent to evaluating the probability 
for fluctuations about the SM assumption to create a differential asymmetry A4 that resem- 
bles the pattern predicted by the NP assumption A^. For this we need (a) the template A\ 
for the NP assumption and (b) an estimate of the size of the fluctuations that can occur 
under the SM assumption. 

We have discussed above how to obtain these things from the data at the LHC But 
since the actual data Di are not yet available, we obtain our NP template A\ from large Si 
and Bi Monte Carlo samples, using formula ([3]). Obtaining the fluctuations under the SM 
assumption is a bit subtle. Since in this section we are assuming the data itself contains 
a signal, our background model must be obtained, according to our data-driven strategy, 
from our simulation of Si + Bi (and not from Bi alone!) We take the expected numbers 
of positive- and negative- charge lepton events to both be equal to half of Si + Bi. We 
then study the fluctuations around this background model by performing 50 000 Poisson- 
fluctuating pseudo-experiments, for positive- and negative-lepton events independently, and 
computing the differential asymmetry for each one. 

Finally, to address our italicized question, we must then ask: what is the probability 
for fluctuations of the asymmetry around zero, given this background model, to resemble 
the "data"? This is done as follows: For each pseudo-experiment, we fit the differential 
asymmetry to the NP template Aj of our benchmark point, keeping the shape of the NP 
template fixed but allowing the amplitude to float. The best-fit amplitude we denote by c„, 
where the index n labels the pseudo-experiment. For illustration, some examples for a couple 



of pseudo-experiments are shown in Figs. 9(a) and 9(b) 
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(a)SM pseudo-experiment, with 
amplitude 0.39 



(c)NP pseudo-experiment, with 
amplitude 0.98 



(b)SM pseudo-experiment, with 
amplitude —0.16 

FIG. 9. Two examples of possible fluctuations of the differential charge asymmetry under the SM 
hypothesis, and one example under the NP hypothesis. The red line is the best fit of the amplitude 
of the NP template to the pseudo-experiment, with the shape held fixed. The NP template that 
was chosen corresponds to the 600 GeV W with gn = 2 and St > 700 GeV. The fluctuations are 
representative for a 5 fb _1 sample. 

Under the SM assumption (zero asymmetry), the expectation value of the c n is zero. 
(Similarly, under the correct NP assumption, the expectation would be 1.) The c n follow a 
Gaussian distribution, whose width gives the standard deviation a c of the c n around zero. 
If an amplitude of size c were observed in the data, the p-value (chance of a fluctuation on 
the SM hypothesis to produce a structure with amplitude c or larger) is then: 



P[X > c] 



1 



+oo 



dce-^ c/(7c)2 



(7) 



'2tio~ c j c 

To get a measure of typical significance, we compute P[X > 1], the probability for the SM 
to produce an resembling the template Aj with an amplitude c exceeding 1. (Recall that 
c = 1 would be the expected value given that nature has chosen this benchmark point.) The 
results of this procedure for our benchmark points, after conversion to standard deviations 



on a Gaussian, are displayed in Table III, for two integrated luminosities and for the optimal 
SV-cut (see below.) In Appendix |Al we also present contour plots of the significance as a 



function of the integrated luminosity and the St cut; see Figs. 12 and 13 



The amount by which the observed significance tends to fluctuate around the expected 
significance depends on the luminosity and the St cut. By running a different set of pseudo- 
experiments based on the NP hypothesis, we can obtain the Gaussian distribution of the 
amplitude of the fit. (An example of such a pseudo-experiment is shown in Fig. 9(c)[ ) Values 
for the width of this distribution give us the statistical error bar on the expected significance, 
and are included in Table IIHI 
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9R 


St cut 


Significance 


(GeV) 


(GeV) 


5 ft- 1 


8 ft)- 1 


400 


1.5 


750 


6.27 ±0.92 


7.49 ± 0.75 


400 


1.5 
V2 


750 


3.38 ±0.95 


4.24 ±0.95 


600 


2 


700 


3.42 ±0.92 


4.08 ± 0.95 


600 




700 


1.79 ±0.83 


2.15 ±0.86 


800 


2 


700 


2.37 ±0.87 


3.12 ±0.92 


800 




700 


1.60 ±0.82 


2.01 ±0.85 



TABLE III. Expected significance and statistical error for SM exclusion at our benchmark points, 
given selected luminosities and optimal St cuts. For the correct interpretation of these numbers, 
please refer to the text. 



Our simplified analysis is imperfect in various ways. One important weakness is that we 
assume that nature matches one of our benchmark points, and we do not consider the effect 
of using the wrong benchmark point in obtaining the exclusion of the SM. In particular, the 
mass of the W we used to obtain the NP template matches the mass of the W in our "data" . 
A finer grid in W mass would address this. (In general, the coupling g^ for the template 
will also differ from the real coupling, but except for its effect on the W width, often smaller 
than the experimental resolution, a change in the coupling affects the amplitude, but not 
the shape, of the corresponding template.) Also, our simplified procedure to fit only for 
the amplitude of the template and to keep the shape fixed does not always capture all the 
features of the asymmetry distribution, as is illustrated in Fig. |9(c) where the central dip 
in the asymmetry is deeper than our fit function can capture. In a more detailed study 
one might choose to let multiple parameters float to obtain a better fit. We further note 
that we are not accounting for the look-elsewhere effect. And finally, although the use of 
asymmetries and a data-driven method reduces systematic errors, we have not considered 
the remaining systematic errors here. 

On the other hand, there are important features of the signal that we are not using 
in our analysis, and including those would enhance the sensitivity. The use of several 
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(correlated) mass variables, and the angle variable discussed in the next section, would 
give some improvements. Moreover, while the charge asymmetry we focus on here has low 
systematic errors but is statistically limited, other observables with higher systematics but 
lower statistical errors, such as the differential cross-section with respect to St, are obviously 
useful as well. In any search for this type of models multiple approaches should be combined. 



VI. AN ANGLE VARIABLE 



In this section we discuss another charge-asymmetric variable, the azimuthal angle be- 
tween the hardest jet without a 6-tag (ji) and the lepton I: 

A<f> jlit = Min[|0,- 1 - <f>i\, 2tt - \<f) h - <^|] . (8) 

With a low St cut, the angle between the hardest jet and an l~ tends to be larger than 
the angle between the hardest jet and an £ + [Figs. 10(a) and 10(b)] . The reason is as follows: 



The W is produced near threshold, so the recoiling top quark or antiquark is not highly 
boosted. The top from the W decay, on the other hand, will recoil back-to-back against the 
d or d (which is usually the source of the hardest jet). Moreover, this top will be somewhat 
boosted since raw 1 3> m t, so if it decays leptonically, the lepton's momentum tends also to 
be back-to-back to the d or d. This results in a large opening angle between the hardest jet 
and the lepton. However, if it is the other top quark that decays leptonically, the angle of 
its lepton with the hardest jet is more randomly distributed. Since the negatively charged 
W is produced more abundantly, this variable will exhibit a charge asymmetry. 

For a high St cut the picture reverses. The W and the top from which it recoils are now 
both boosted and typically back-to-back with one other. The decay products from the W 
tend to be aligned with each other. In other words, a cluster of four objects (from the W) 
is now recoiling against a cluster of three objects (the top). The hardest jet is typically still 
the down quark from the W decay. If the lepton's parent is the top from the W, Afy^ 
tends to be small, while the opposite is true if the lepton comes from the recoiling top. (See 



Figs. 10(c) and 10(d).) 



This reversing structure in the asymmetry as a function of the St cut is useful, as it 
potentially provides a very strong hint of new physics. However, there is an intermediate 
St cut where the asymmetry is essentially zero, so in that range the variable is not useful. 
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(b)Asymmetry of the angle variable in signal only 
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(d)Asymmetry of the angle variable in signal only 
for M W ' = 400 GeV and g R = 1.5. 



FIG. 10. Angle difference between the lepton and the hardest jet at parton-level signal-only, for 
W's of mass 800 and 400 GeV with an St cut at 700 GeV. The samples correspond to 5 fb _1 . 



For this reason, we recommend studying this variable as a function of the St cut. 

We explicitly checked that the standard model will not introduce a large asymmetry in 



this angle variable, for any St cut. A particular case is shown in Fig. [TTJ Our reasoning for 
trusting a LO Monte Carlo is the same as was described in Sec. V A| for the mass variable. 

An interesting feature of this angle variable in the W model (though whether this is true 
in other models has not yet been studied) is that the point where the number of positive and 
negative lepton events is roughly equal is insensitive to mw> and g^. For all our benchmark 
points we find A(/>j u i ~ 2 to be a suitable place to break the signal into two bins. The 



detector-level asymmetries in both bins are given in Table |IV[ To estimate the significance, 
we follow a strategy similar to the one mentioned for the mass variable. However instead of 
fitting for the amplitude of a previously obtained template, we compute the difference of the 
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(a)The angle variable for SM background with 
a 700 GeV S T cut. 



(b)Bin-by-bin asymmetry in the angle variable for 
SM background with a 700 GeV S T cut. 

FIG. 11. A parton- level study on SM background asymmetry for the angle variable with a 700 
GeV St cut, corresponding to 12 fb _1 luminosity. 
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Asymmetry (%) 


(GeV) 
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1 st bin 
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TABLE IV. Expected asymmetry at detector-level in the angle variable for each superbin, for our 
benchmark points using the optimal St cut. 



asymmetry of the two superbins and establish the Gaussian probability distribution for this 
variable using pseudo-experiments on the SM hypothesis. Plots of the resulting significance 
of this observable as a function of St cut and luminosity can be found in Figs. 14 and 15 in 
Appendix [A} 

The greatest merit of the angle variable is its simplicity. Both the hardest jet and the 
lepton are well-measured, and in contrast to the mass variables no (partial) event reconstruc- 
tion is needed. Unfortunately the angle variable is more sensitive than Mjuw to interference 
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effects between signal and background. Whether the contribution from interference is pos- 
itive or negative depends on the mass of new particle, the St cut and the model we study. 
The effect, however, appears to be only moderate. We find that, for a W mass of 800 GeV 
and an St cut of 700 GeV, the asymmetry for the two bins after interference is included is 
reduced by about 15%. A more detailed study including interference is advisable to give a 
precise estimate of its effects, especially for other models where interference might be more 
important. 



VII. FINAL REMARKS 

At the LHC, models that attempt to explain the Tevatron ti forward-backward asymme- 
try with the exchange of a particle X in the t- or ^-channel generate a charge-asymmetric 
signal in tX production. This leads to observable charge asymmetries in certain variables 
within ttj samples. Among interesting observables are mass variables involving various final 



state objects including the hardest jet and/or the lepton (Sees. Ill and M), the azimuthal 



angle between the lepton and the hardest jet (Sec. VI) and the Pt difference between the 
tops and W bosons (Appendix O) . Of these variables, the invariant mass of the hardest jet, 
the leptonic W and a 6-tagged jet appears to be the most powerful and the most universal, 
since it tends to reconstruct the W mass resonance. The charge asymmetry of this variable 
exhibits a negative asymmetry in the region of the W mass, and a positive asymmetry 
elsewhere. We have proposed a data-driven method to extract a statistical significance from 
this asymmetry structure. 

One could of course go further by fully reconstructing the events, and directly observe 
that W'~ production is larger than W' + production. However demanding full reconstruc- 
tion would lead to a considerable loss of efficiency. Since we cannot realistically estimate 
this efficiency loss, we cannot evaluate the pros and cons of this approach, but clearly the 
experiments should do so. 

We have described this asymmetry measurement on its own, without discussing the fact 
that simultaneously the experiments will be measuring charge-symmetric variables, such as 
the cross-section for ttj as a function of St- Of course these variables are complementary, 
and we do not in any way mean to suggest that one should do one instead of the other. 
Charge-symmetric variables may often have lower statistical uncertainties, but in most cases 
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background-subtraction is necessary, so there will be large systematic errors. The combina- 
tion of the two types of measurements will help clarify the situation far better than either 
one could in isolation. Additional information will come from the differential charge asym- 
metry in ti events at the LHC, which is a direct test of the Tevatron measurement of the tt 
forward-backward asymmetry, and is sensitive to any growth of the effect with energy. 

A very important aspect of our approach is that the asymmetry is a diagnostic for models. 
An s-channel mediator will not generate a peak for either lepton charge, and so even if an 
asymmetry in tij were generated, it would be largely washed out in the variable Mj^w- 
Among models with t- or w-channel mediators X, some will produce a negative asymmetry 
at Mjuw = mx, while others will produce a positive asymmetry. For example, models that 
replace the W by a color triplet or color sextet scalar X [67H70] that couples to ur and 
tn (and has charge 4/3) will have the opposite sign, because the process ug —> iX + will be 
larger than ug — > tX~. The approach we use will still apply, but the asymmetry will be 
positive in the neighborhood of the X mass peak, rather than negative as it is for the W. 
For this reason, even if it turns out that the asymmetry measurement is not needed for a 
discovery of the X particle, it will still be an essential ingredient in determining its quantum 
numbers and couplings. 

What seems clear from our results is that the data already available (or soon to be 
available) at the 7 TeV LHC should be sufficient to allow for an informative measurement 
of charge- asymmetric observables in tij to be carried out. We look forward to seeing studies 
of tij from ATLAS and CMS, and we hope that measurements of charge asymmetries will 
be among them. 
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Appendix A: Additional Results 



1. Contour plots for the mass variable 



As can be seen in Figs. 12 and 13, we find that the optimal Sr-cut for the mass variable 
does not vary greatly with luminosity, or even with the W mass: it lies around 700 GeV 
for the 600 GeV and 800 GeV W and is slightly higher for the 400 GeV W. At lower S T 
cuts, reduced signal-to-background ratio worsens the significance. The reason a large St cut 
works well even for low W mass is that the distribution for the charge-symmetric component 
of the signal (mainly t-channel W exchange) peaks at low St for a lighter W . Meanwhile, 
for an overly high St cut the remaining signal is too small. But we should mention that our 
binning procedure makes our results too pessimistic here. 

When producing these contour plots, we choose a fixed binsize of 50 GeV everywhere 
except in the upper and lower tails of the distribution, where we use a superbin. The 
superbins are sized so that that no bin ever contains fewer than 50 events. For higher St, 
there are very few bins between the two superbins, and this makes the peak-valley-peak 
structure weak, ruining the significance of the measurement. Within the white region in the 
upper left of the plots, the number of events is so small that no bin with more than 50 events 
exists, and our binning strategy gives a null result. However, for a high St cut one could 
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FIG. 12. Expected significance of the Mjibw variable for a 400 GeV W , as a function of luminosity 
and St cut. 
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choose a more sophisticated binning strategy. We have verified in a few particular cases that 
larger bins for higher St cuts can restore some of the significance of the measurement. All 
of this is to say that sophisticated treatment of the data may lead to a somewhat better 
result than our simple-minded binning strategy would suggest. 
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FIG. 13. Expected significance of the MjxhW variable for a 600 GeV and an 800 GeV W', as a 
function of luminosity and St cut. 
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2. Contour plots for the angle variable 



The plots below show the significance for exclusion of the SM hypothesis using the angle 
variable, along the lines of our method used for the mass variable. Note the band of low 
significance for the W with mass of 600 GeV, caused by the shifting structure that we 



emphasized in Sec. |VI[ for an St cut of around 700 GeV, the asymmetry shifts from one 
sign to the other. A study exploiting this dependence of the asymmetry on the St cut would 
have larger significance, but we have not explored this option here. 
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FIG. 14. Expected significance of £ for a 400 GeV and a 600 GeV W, as a function of 
luminosity and St cut. For the 600 GeV W the dark band corresponds to the range of St cuts 
where the asymmetry is changing sign, which results in a much reduced sensitivity. Interference 
between signal and background is not accounted for. 
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FIG. 15. Expected significance of Afi^g for an 800 GeV W, as a function of luminosity and St 
cut. Interference between signal and background is not accounted for. 

Appendix B: Strategy Details 

1. Isolation Procedure 

The detector simulation DELPHES produces particle candidates and requires the user 
to impose the isolation criteria of his or her choice. Hence for each lepton candidate in the 
DELPHES output there will be a corresponding jet candidate, and it is up to the user to 
decide which one to include in the analysis. To facilitate this choice, DELPHES provides 
the user with the following variables for each lepton: 

• UPt'- The sum of the Pt of all the tracks with Pt > 0.9 GeV in a cone of AR = 0.3 
around the leading track, excluding that track. 

• p\. The sum of the energy deposited in a 3x3 calorimeter grid around the leading 
track, divided by the Pt of that track. 

Here we lay out the isolation criteria we imposed on the various particle candidates. An 
isolated electron is defined as an electron candidate for which £Pr < 10 GeV, EPr- < 0.15P T 
and p e < 1.15. For isolated muons we require EPr < 10 GeV, EPr < 0.15P^ and p M < 0.15. 
Finally jet candidates are retained if no isolated leptons are found in a cone of radius 0.3. 
When a previously isolated lepton is found in a 0.3 cone, the jet candidate is identified 
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with the lepton and therefore removed from the event. We hereby impose two consistency 
conditions: 

• No more than 1 isolated lepton is found in a 0.3 cone 

• When one isolated lepton is found, the Pt of the jet candidate can differ by no more 
than 10% from the Pt of the isolated electron. 

When one of these criteria is not met, we are unable to carry out a consistent isolation 
procedure and the entire event is thrown out. The efficiency of our isolation procedure is 
97%, both for signal and background samples. 



Appendix C: The Py-difference variables 

Among other variables that show charge-asymmetries, ones of possible further interest 
include the difference in Pt between the t and the i, or between the positive and negative 
W bosons. 

Since one top quark is recoiling against the W, while the other top quark is a decay 
product of the W, one would expect their kinematics to differ. The Pt difference between 
the t and i is a variable in which this feature of the signal will manifest itself. The same is 
true for the W bosons from the t and i decays. For each event, we can calculate 

AP T , W = ^ ~ P ™~ m d AP T ,, = ^i^. (CI) 

The charge asymmetry at parton-level for these variables can be seen (for pure signal) in 



Figs. 16(a) and 16(b) 



Although spectacular at parton-level, we find that the Pt difference between the top 
quarks gets washed out a lot at detector-level by resolution effects and mis- reconstructions. 
Nevertheless we encourage experimental colleagues to take this variable in consideration, 
since state-of-the-art top reconstruction methods might alleviate this problem. The Pt 
difference between the W bosons is less pronounced at parton-level, but does survive our 
detector simulation and the reconstruction of the hadronic W . We find it is particularly 
useful for a low mass W. Like the angle variable, it changes sign as a function of the St 
cut. 
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FIG. 16. Top quark and W boson Pt difference at parton-level in signal with a 400 GeV W with 
St cut at 700 GeV. The sample is corresponding to a luminosity of 5 fb . 



We have not studied the effect of interference on these Pt variables. Whether the asym- 
metry from the SM tij background is important also requires further study. 
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